1.0 Introduction

1.1 - Location of Zambia

Zambia is a landlocked country located in South-Central Africa and is bordered by 8 countries, namely the Democratic Republic of the Congo to the north, Tanzania to the northeast, Malawi to the east, Mozambique to the southeast, Zimbabwe and Botswana to the south, Namibia to the southwest, and Angola to the west.

1.2 - Geography and Climate

  • Zambia is the 38th largest country in the world at 290,587 sq.mi.
  • Consists mostly of high plateaus with hills and mountains that are dissected by river valleys, with an average elevation of 1,200m above sea level giving the land a moderate tropical climate.
  • Average monthly temperatures of above 20 degrees Celsius persists for the majority of the year.
  • There are 2 major river basins in Zambia that keeps it drained, 1 is the Congo basin to the north and the other is the Zambezi basin towards the centre, west and south of the country.
  • The Victoria Falls lies on the border between Zambia and Zimbabwe.

1.3 - A Brief Timeline of COVID-19 in Zambia

March, 2020

  • 17th - Government shuts down all educational institutions and imposes restrictions on foreign travel.
  • 18th - First 2 cases are reported.

April, 2020

  • 2nd - First death is reported.

December, 2020

  • 30th - 501.V2 (Beta) variant is first detected in Zambia.

July, 2021

  • 1st - Maximum number of deaths in a single day (up to present day) is recorded at 72.

December, 2021

  • 4th - Omicron variant is first detected in 3 people.
  • 30th - Maximum number of confirmed cases in a single day (up to present day) is recorded at 5555.

2.0 Exploratory Data Analysis

2.1 - Structure of the Dataset

The initial dataset obtained from filtering only for Zambia from the “coronavirus” dataset consisted of 2652 observations recording the number of confirmed cases, recovered cases and deaths per day during the time period of 22/01/2022 - 23/06/2022.

However, recovered cases beyond 05/08/2021 have not been recorded and were removed from the final dataset considered for analysis during the data cleaning process, resulting in the following dataset consisting of 2329 observations with a structure as can be seen below.

As such, recovered cases were not considered in the analysis proper.

knitr::kable(head(zam_cov_full))
date province country lat long type cases uid iso2 iso3 code3 combined_key population continent_name continent_code
2020-01-22 NA Zambia -13.1339 27.84933 confirmed 0 894 ZM ZMB 894 Zambia 18383956 Africa AF
2020-01-23 NA Zambia -13.1339 27.84933 confirmed 0 894 ZM ZMB 894 Zambia 18383956 Africa AF
2020-01-24 NA Zambia -13.1339 27.84933 confirmed 0 894 ZM ZMB 894 Zambia 18383956 Africa AF
2020-01-25 NA Zambia -13.1339 27.84933 confirmed 0 894 ZM ZMB 894 Zambia 18383956 Africa AF
2020-01-26 NA Zambia -13.1339 27.84933 confirmed 0 894 ZM ZMB 894 Zambia 18383956 Africa AF
2020-01-27 NA Zambia -13.1339 27.84933 confirmed 0 894 ZM ZMB 894 Zambia 18383956 Africa AF
summary(zam_cov_full)
##       date              province           country               lat        
##  Min.   :2020-01-22   Length:2329        Length:2329        Min.   :-13.13  
##  1st Qu.:2020-08-03   Class :character   Class :character   1st Qu.:-13.13  
##  Median :2021-02-13   Mode  :character   Mode  :character   Median :-13.13  
##  Mean   :2021-02-27                                         Mean   :-13.13  
##  3rd Qu.:2021-09-05                                         3rd Qu.:-13.13  
##  Max.   :2022-06-23                                         Max.   :-13.13  
##       long           type               cases             uid     
##  Min.   :27.85   Length:2329        Min.   :   0.0   Min.   :894  
##  1st Qu.:27.85   Class :character   1st Qu.:   0.0   1st Qu.:894  
##  Median :27.85   Mode  :character   Median :   9.0   Median :894  
##  Mean   :27.85                      Mean   : 222.7   Mean   :894  
##  3rd Qu.:27.85                      3rd Qu.: 122.0   3rd Qu.:894  
##  Max.   :27.85                      Max.   :5555.0   Max.   :894  
##      iso2               iso3               code3     combined_key      
##  Length:2329        Length:2329        Min.   :894   Length:2329       
##  Class :character   Class :character   1st Qu.:894   Class :character  
##  Mode  :character   Mode  :character   Median :894   Mode  :character  
##                                        Mean   :894                     
##                                        3rd Qu.:894                     
##                                        Max.   :894                     
##    population       continent_name     continent_code    
##  Min.   :18383956   Length:2329        Length:2329       
##  1st Qu.:18383956   Class :character   Class :character  
##  Median :18383956   Mode  :character   Mode  :character  
##  Mean   :18383956                                        
##  3rd Qu.:18383956                                        
##  Max.   :18383956

2.2 - Zambia

Some key takeaways from Fig. 2 -

  • The time series plot for daily COVID-19 cases seems to indicate some seasonal pattern based on the data at hand, with increasing variance as time progresses. The plot for daily COVID-19 deaths does not show as clear of a pattern as that of daily cases, however both plots hit their peaks and valleys roughly in conjunction with one another, which suggests some correlation between the two at first glance.

Peaks (roughly) - May 2020, July 2020, Jan 2021, Jun 2021, late Dec-early Jan 2022

Valleys (roughly) - Jun 2020, Oct/Nov 2020, May 2021, Nov 2021

  • The 1st death has been recorded on 2020-04-02, roughly 2 weeks after the 1st confirmed case on 2020-03-18.
  • The maximum number of confirmed cases on single day was on 2021-12-30, with 5555 cases and the maximum number of deaths on a single day was on 2021-07-01, with 72 deaths.
  • Total confirmed COVID-19 infections accounts for roughly 1.8% of the population, with approximately a further 1.2% of those infected resulting in death.
  • Zambia has had approximately 80 times as many confirmed cases as they have deaths.

Note - The proportions of the stacked area graphs in the above figure are not immediately accurate as the cumulative totals on the y axis have been scaled by square root in order to better visualize the cumulative deaths plot

2.3 - Zambia vs Neighbouring Countries

A collection of neighbouring countries in the southern region of Africa were selected in order to compare COVID-19 data on a regional level, with 4 of the 5 chosen being countries that directly sit on the border of Zambia (i.e. Angola, Mozambique, Namibia and Zimbabwe) and the 5th being South Africa, a major player both politically and economically in the region that was severely affected during the COVID-19 pandemic.

  • Upon pairwise comparison of the time series plots of daily confirmed COVID-19 cases in Fig. 4, Zambia exhibits an approximately similar distribution in its peaks and valleys of the plot to all the countries compared, except for Angola which appears to lag behind the other countries on the time scale and peaks when other countries are experiencing a valley and vice versa multiple times pre-2022.
  • South Africa shows a significantly greater number of confirmed cases throughout the plot, with the other 5 countries being more closely grouped together.
  • South Africa also recorded its first confirmed case in early March of 2020, followed by Namibia in mid March and finally Zambia, Zimbabwe, Angola and Mozambique respectively following soon after by mid-late March of the same year.

By Fig. 5,

  • We can see the discrepancy between South Africa and the rest with respect to confirmed cases even more clearly than in Fig. 4, with the total cases of the other 5 countries combined still amounting to less than a third of South Africa’s. (approximately 4 million vs 1.1 million)

  • Zambia shows the 2nd highest count of total cases amongst those considered at appr. 325,000, around 70,000 cases more than the next highest, Zimbabwe.

  • Angola has the lowest count of total cases at appr. 100,000 by a relatively sized margin in comparison to the differences between non South African countries. (gap of appr. 70,000 between itself and the next lowest, Namibia)

By Fig. 6,

  • The contrast in the total deaths values are even more stark than total cases when considering South Africa vs the rest, as South Africa’s death count is more than 5 times as much as the others combined.

  • Total death counts of the non South African countries are similar relatively speaking, with Angola having the lowest as was the case with total cases. Zambia despite having the 2nd highest number of total cases by a decent margin is only the 4th highest in total death count.

  • South Africa continues to have the highest infection rate even when adjusted for population.
  • Namibia, despite having the 2nd lowest total COVID-19 cases among countries compared, has an infection rate that is almost equal to that of South Africa’s. A similar, but not as drastic change is seen in Namibia’s death rate too, as it trails South Africa’s leading death rate only by appr. 0.15.
  • Zambia is 1 of 3 countries to have a higher infection rate than a death rate, the others being Namibia and South Africa. Zambia also has the 2nd lowest death rate at appr. 1.2% with the next lowest being the 1% death rate of Mozambique.
  • Notably Angola shows a significantly lower infection rate of just 0.3% when compared to the rest, with the next lowest being Mozambique at 0.7% which is also quite low in and of itself as the others are all at least a full percentage point larger.

2.4 - Zambia vs Some Countries from Around the World

Why the US, NZ, Sweden and Sri Lanka?

  • The US is the country with the highest total COVID-19 cases and deaths recorded in the world.
  • NZ as their Government was known for their extremely swift imposition of lockdowns at the onset of the pandemic.
  • Sweden as they were known for their limited measures, notably that of no lockdowns.
  • SL for added context.
  • Zambia’s death rate is the 2nd highest compared to the other countries on the plot save for Sri Lanka’s, which is more than 2 times that of Zambia’s.
  • The US, despite having claim to the highest number of COVID-19 cases and death totals in the world, still has a lower death rate than Zambia.
  • All countries plotted show higher infection rates than death rates.

3.0 Conclusions and Discussion

3.1 - Conclusions

  • The spread of COVID-19 in Zambia (in terms of the timeline of daily cases and deaths) is consistent with that of its neighbours, except for Angola.
  • All of Zambia’s immediate neighbours show similar totals in terms of confirmed case and death totals, except for South Africa.
  • Zambia’s death rate is one of the lowest in the region.
  • Zambia hit its peak levels of daily COVID-19 cases in late Dec-early Jan of 2022 and peak daily COVID-19 deaths in early July of 2021.
  • Developed nations in the analysis such as Sweden, US, NZ showed significantly higher infection rates than the African countries and Sri Lanka.

3.2 - Discussion

  • The missing values of recovered COVID-19 cases for Zambia narrows the scope of the analysis somewhat.
  • Countries having low infection rates may be due to under reporting of COVID-19 cases, especially since developed nations show significantly higher infection rates possibly due to increased expenditure and diligency towards testing.
  • A potential shortcoming of the death rate stat is that it is dependent on infection rate and such under reporting may lead to inflated death rates.
  • Death rate differences of countries may be within some margin of random error.
  • Due to the lack of demographic/socioeconomic data in the analysis, the impact of factors such as income, education, age etc are not distinguishable as reasons behind observed differences.

4.0 References